188 ◾ Bioinformatics
We can plot the BCV against gene abundance (in log2 counts per million) using
“plotBCV(y)” function. As described above, the BCV is the square root of the dispersion
parameter in the negative binomial model.
jpeg(‘myBCVPlot.jpg’)
plotBCV(yNorm, pch=16, cex=1.2)
dev.off()
The above script plots the biological coefficient of variations (based on the three types of
dispersions) against the average abundance of each gene. As shown in Figure 5.14, the plot
shows the square root estimates of the common, trended, and tagwise negative binomial
dispersions.
For RNA-Seq count data, the negative binomial dispersions tend to be higher for genes
with very low counts and the dispersion trend tends to decrease smoothly with the abun-
dance (CPM) increase and becomes asymptotic to a constant value for genes with a larger
abundance.
FIGURE 5.13 Printing dispersions.
FIGURE 5.14 Biological CV plot.